Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Improvement of numeral recognition using personal handwriting characteristics based on clustering

Identifieur interne : 001892 ( Main/Exploration ); précédent : 001891; suivant : 001893

Improvement of numeral recognition using personal handwriting characteristics based on clustering

Auteurs : Yoshinobu Hotta [Japon] ; Satoshi Naoi [Japon] ; Misako Suwa [Japon]

Source :

RBID : ISTEX:14E9C230458E7425FE2677DC897D30466FCC6E82

Descripteurs français

English descriptors

Abstract

Correctly recognizing characters with peculiarities for each writer is a difficult problem. The process of absorbing variations in individual writing by creating an individual dictionary is also difficult when a writer is not specified and the total number of writers is large. In this paper the authors propose a method to improve the results of isolated character recognition in forms in which the same writer writes many characters by taking the characteristics of the writer's writing on a form as a character distribution in a character feature space. In concrete terms, the authors first perform isolated character recognition on all characters on the same form. Then, based on the results of isolated character recognition, clustering of input character groups is performed for each character category. Clusters which are very likely to include misrecognized characters from isolated character recognition are extracted based on the results of clustering. Then character categories in the extracted cluster are automatically amended based on the distance from all clusters in other categories. In the same fashion, automatic amending is performed for rejected characters. Based on experiments to evaluate handwritten numerals on OCR forms, the authors show that the precision of numeral recognition is improved by using this approach as a form of postprocessing for isolated character recognition. © 2002 Wiley Periodicals, Inc. Syst Comp Jpn, 33(7): 104–113, 2002; Published online in Wiley InterScience (www.interscience.wiley.com). DOI 10.1002/scj.1146

Url:
DOI: 10.1002/scj.1146


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI wicri:istexFullTextTei="biblStruct">
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Improvement of numeral recognition using personal handwriting characteristics based on clustering</title>
<author>
<name sortKey="Hotta, Yoshinobu" sort="Hotta, Yoshinobu" uniqKey="Hotta Y" first="Yoshinobu" last="Hotta">Yoshinobu Hotta</name>
</author>
<author>
<name sortKey="Naoi, Satoshi" sort="Naoi, Satoshi" uniqKey="Naoi S" first="Satoshi" last="Naoi">Satoshi Naoi</name>
</author>
<author>
<name sortKey="Suwa, Misako" sort="Suwa, Misako" uniqKey="Suwa M" first="Misako" last="Suwa">Misako Suwa</name>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:14E9C230458E7425FE2677DC897D30466FCC6E82</idno>
<date when="2002" year="2002">2002</date>
<idno type="doi">10.1002/scj.1146</idno>
<idno type="url">https://api.istex.fr/document/14E9C230458E7425FE2677DC897D30466FCC6E82/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">001005</idno>
<idno type="wicri:Area/Istex/Curation">000F55</idno>
<idno type="wicri:Area/Istex/Checkpoint">001006</idno>
<idno type="wicri:doubleKey">0882-1666:2002:Hotta Y:improvement:of:numeral</idno>
<idno type="wicri:Area/Main/Merge">001972</idno>
<idno type="wicri:source">INIST</idno>
<idno type="RBID">Pascal:02-0303397</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000664</idno>
<idno type="wicri:Area/PascalFrancis/Curation">000128</idno>
<idno type="wicri:Area/PascalFrancis/Checkpoint">000610</idno>
<idno type="wicri:doubleKey">0882-1666:2002:Hotta Y:improvement:of:numeral</idno>
<idno type="wicri:Area/Main/Merge">001A63</idno>
<idno type="wicri:Area/Main/Curation">001892</idno>
<idno type="wicri:Area/Main/Exploration">001892</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title level="a" type="main" xml:lang="en">Improvement of numeral recognition using personal handwriting characteristics based on clustering</title>
<author>
<name sortKey="Hotta, Yoshinobu" sort="Hotta, Yoshinobu" uniqKey="Hotta Y" first="Yoshinobu" last="Hotta">Yoshinobu Hotta</name>
<affiliation wicri:level="1">
<country xml:lang="fr">Japon</country>
<wicri:regionArea>Fujitsu Laboratories Ltd., Kawasaki</wicri:regionArea>
<wicri:noRegion>Kawasaki</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Naoi, Satoshi" sort="Naoi, Satoshi" uniqKey="Naoi S" first="Satoshi" last="Naoi">Satoshi Naoi</name>
<affiliation wicri:level="1">
<country xml:lang="fr">Japon</country>
<wicri:regionArea>Fujitsu Laboratories Ltd., Kawasaki</wicri:regionArea>
<wicri:noRegion>Kawasaki</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Suwa, Misako" sort="Suwa, Misako" uniqKey="Suwa M" first="Misako" last="Suwa">Misako Suwa</name>
<affiliation wicri:level="1">
<country xml:lang="fr">Japon</country>
<wicri:regionArea>Fujitsu Laboratories Ltd., Kawasaki</wicri:regionArea>
<wicri:noRegion>Kawasaki</wicri:noRegion>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series>
<title level="j">Systems and Computers in Japan</title>
<title level="j" type="abbrev">Syst. Comp. Jpn.</title>
<idno type="ISSN">0882-1666</idno>
<idno type="eISSN">1520-684X</idno>
<imprint>
<publisher>Wiley Subscription Services, Inc., A Wiley Company</publisher>
<pubPlace>New York</pubPlace>
<date type="published" when="2002-06-30">2002-06-30</date>
<biblScope unit="volume">33</biblScope>
<biblScope unit="issue">7</biblScope>
<biblScope unit="page" from="104">104</biblScope>
<biblScope unit="page" to="113">113</biblScope>
</imprint>
<idno type="ISSN">0882-1666</idno>
</series>
<idno type="istex">14E9C230458E7425FE2677DC897D30466FCC6E82</idno>
<idno type="DOI">10.1002/scj.1146</idno>
<idno type="ArticleID">SCJ1146</idno>
</biblStruct>
</sourceDesc>
<seriesStmt>
<idno type="ISSN">0882-1666</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Correlation methods</term>
<term>Experiments</term>
<term>Glossaries</term>
<term>Number theory</term>
<term>Numeral recognition</term>
<term>Optical character recognition</term>
<term>Theory</term>
<term>clustering.</term>
<term>handwritten numerals</term>
<term>personal handwriting characteristics</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr">
<term>Expérience</term>
<term>Glossaire</term>
<term>Méthode corrélation</term>
<term>Reconnaissance optique caractère</term>
<term>Théorie</term>
<term>Théorie nombre</term>
</keywords>
</textClass>
<langUsage>
<language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Correctly recognizing characters with peculiarities for each writer is a difficult problem. The process of absorbing variations in individual writing by creating an individual dictionary is also difficult when a writer is not specified and the total number of writers is large. In this paper the authors propose a method to improve the results of isolated character recognition in forms in which the same writer writes many characters by taking the characteristics of the writer's writing on a form as a character distribution in a character feature space. In concrete terms, the authors first perform isolated character recognition on all characters on the same form. Then, based on the results of isolated character recognition, clustering of input character groups is performed for each character category. Clusters which are very likely to include misrecognized characters from isolated character recognition are extracted based on the results of clustering. Then character categories in the extracted cluster are automatically amended based on the distance from all clusters in other categories. In the same fashion, automatic amending is performed for rejected characters. Based on experiments to evaluate handwritten numerals on OCR forms, the authors show that the precision of numeral recognition is improved by using this approach as a form of postprocessing for isolated character recognition. © 2002 Wiley Periodicals, Inc. Syst Comp Jpn, 33(7): 104–113, 2002; Published online in Wiley InterScience (www.interscience.wiley.com). DOI 10.1002/scj.1146</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>Japon</li>
</country>
</list>
<tree>
<country name="Japon">
<noRegion>
<name sortKey="Hotta, Yoshinobu" sort="Hotta, Yoshinobu" uniqKey="Hotta Y" first="Yoshinobu" last="Hotta">Yoshinobu Hotta</name>
</noRegion>
<name sortKey="Naoi, Satoshi" sort="Naoi, Satoshi" uniqKey="Naoi S" first="Satoshi" last="Naoi">Satoshi Naoi</name>
<name sortKey="Suwa, Misako" sort="Suwa, Misako" uniqKey="Suwa M" first="Misako" last="Suwa">Misako Suwa</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 001892 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 001892 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     ISTEX:14E9C230458E7425FE2677DC897D30466FCC6E82
   |texte=   Improvement of numeral recognition using personal handwriting characteristics based on clustering
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024